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Abstract 

Let 7a be the compact manifold of real symmetric tridiagonal matrices 
conjugate to a given diagonal matrix A with simple spectrum. We introduce 
bidiagonal coordinates, charts defined on open dense domains forming an ex- 
plicit atlas for 7a. In contrast to the standard inverse variables, consisting of 
eigenvalues and norming constants, every matrix in 7a now lies in the inte- 
rior of some chart domain. We provide examples of the convenience of these 
new coordinates for the study of asymptotics of isospectral dynamics, both 
for continuous and discrete time. 

Keywords: Jacobi matrices, tridiagonal matrices, norming constants, Toda flows, 
QR algorithm. 
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1 Introduction 

Let A be a real diagonal matrix with simple spectrum and T\ be the manifold of 
real, symmetric, tridiagonal matrices having the same spectrum as A. The purpose 
of this paper is to present an explicit atlas for T\: the charts in the atlas define the 
bidiagonal coordinates on open dense subsets of T\. As is familiar to numerical ana- 
lysts, many algorithms to compute spectra operate by iteration on Jacobi matrices, 
yielding approximations of reduced tridiagonal matrices. Given a limit point p for 
such iterations, there is a chart in the atlas containing p in its interior, reducing 
the study of asymptotic behavior to a matter of local theory. The construction 
of the atlas was motivated by our study of the asymptotics of the Wilkinson shift 
iteration: we use bidiagonal coordinates to prove that this well known algorithm 
deflates cubically for generic spectra ([TO]) but only quadratically for certain initial 
conditions f [TT]). 

Jacobi matrices are frequently parameterized by its (simple) eigenvalues and the 
vector w of (positive) first coordinates of its normalized eigenvectors, the norming 
constants. An algorithm to recover a Jacobi matrix from these data was known 
to Stieltjes ([I3]). From the procedure, one learns that the set C T\ of Jacobi 
matrices with prescribed simple spectrum is diffcomorphic to R" _1 . Norming con- 
stants break down at the boundary of Ja, and new techniques are required to study 
its closure Ja within the space of symmetric matrices. In [19], was proved to be 
homeomorphic to a convex polytope "Pa and the boundary of J7a was described as a 
union of cells of reduced tridiagonal matrices. But one may go beyond: by making 
all possible changes of sign along the off-diagonal entries (i + l,i) of the matrices in 
J7a, one obtains 2 n_1 copies of J7a, which glue along their boundaries to form the 
compact manifold 7a. 

There are significant theoretical advantages for considering the manifold 7a 
instead of J7a or even Ja- In algorithms to compute the spectrum of Jacobi matrices, 
the limit point is usually a reduced matrix: if the limit point lies in the interior of 



1 



the domain, asymptotic behavior becomes amenable to local theory. Furthermore, 
signs of off-diagonal entries are often dropped in such algorithms. This procedure, 
which is computationally practical, may introduce theoretical complications akin 
to inserting absolute values on a smooth function. Enlarging the domain of such 
iterations to 7a may allow for a choice of signs which respects smoothness: one is 
then entitled to use Taylor expansions in the local study of the iteration. 

The bidiagonal coordinates j3£, i = 1, . . . , n— 1, which play the role of generalized 
norming constants, are defined on a cover 1l\ of open dense subsets of 7a indexed by 
permutations tt e S„. On each U\, the bidiagonal coordinates give rise to a chart of 
the atlas, i.e., a diffeomorphism to : U\ — > R" _1 . The underlying construction 
is easy to describe. A matrix M is LU -positive if it admits a (unique) factorization 
AI = LU where L is lower unipotent (i.e., lower triangular with unit diagonal) and 
U is upper triangular with positive diagonal entries. Set A = diag(Ai, . . . , A n ), Ai < 
■ • • < A n and, for 7r g S n , let A 71 " = diag(A 7r ( 1 ), . . . , A 7r („)). A matrix T G 7X belongs 
to U\ if it admits a diagonalization T = Q* A^Q^ for some orthogonal LC/-positive 
matrix Q n = L^U*; in particular, A n e U\. Now set B„ = L~ 1 A 7r £ 7r = U^TU^ 1 . 
From the formulae, is simultaneously lower triangular and upper Hessenberg, 
hence lower bidiagonal. The construction of the chart is complete: 
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It turns out that if T € 7a is unreduced then T € for all it £ S n (Lemma I3.2[) . 

As far as we know, this construction of the matrices and B x was first used 
by Terwilliger in his study of Leonard pairs (|17j): our matrix L n , for example, 
appears in his lemma 4.4 as E r . Carnicer and Pefia ([3]) also consider changes of 
basis leading to bidiagonal matrices in their study of oscillatory matrices. 

For Jacobi matrices, bidiagonal coordinates are, up to a multiplicative factor, 
quotients of norming constants (Proposition 13 . 6| ) : 
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Norming constants, however, yield no chart for a neighbor- 



hood of a diagonal matrix in 7a. . Bidiagonal coordinates imply that in U\ appropri- 
ate quotients of norming constants tu 7r (i+i)/'i^7r(i) admit natural smooth extensions, 
a fact discussed in [7] and [12]. There is however no satisfactory definition for the 
sign of norming constants for matrices throughout T\: this will be discussed more 
carefully at the end of Section 2. 

In the next two sections, we consider the theoretical setup. Section 2 contains 
some basic facts about Jacobi matrices and norming constants, presented using the 
concept of LJ7-positivity so as to prepare the reader to the discussion of bidiagonal 
coordinates. We also collect some geometric properties of the isospectral manifold 
7a: the case n = 3 is taken as a detailed example. In section 3, we describe the 
domains U\ both in terms of L[/-positivity and based on a cell decomposition of 
7a. We then construct the charts tp v : U\ — > R™ -1 and their inverses <j> n : the 
bidiagonal coordinates for T £U][ are (/3f , . . . , /3£_i) = ^^{T). We also prove that 
the quotients (3f /((T)j_|_i^) are smooth strictly positive functions in IA\. 

In order to provide applications, we concentrate on two kinds of dynamics acting 
on Jacobi matrices: QR steps (Section 4) and Toda flows (Section 5). Algorithms 
to compute eigenvalues of Jacobi matrices which are related to the QR factoriza- 
tion, as well as the flows in the Toda hierarchy, admit a very simple description in 
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bidiagonal coordinates: they evolve linearly in time. From this description, limits 
at infinity (with asymptotic rates) are immediate. As a slightly more complicated 
example, we prove the cubic convergence of the Rayleigh quotient shift iteration 
using a Taylor expansion. More precisely, given A, let G(T) £ 7/V be obtained 
from T S 7X by a Rayleigh quotient step: we prove that there exist c, e > such 
that if |(T) n> „_i| < e then |(G(T)) n)n _i| < min(e, c|(T) n! „_i| 3 ). The reader should 
compare this argument with the more complicated study of the asymptotics of the 
Wilkinson's shift iteration in [10] and [IT] . We conclude the paper with the compu- 
tation of the wave and scattering maps of the standard Toda flow, a physical system 
consisting of n particles on the line under the influence of a special Hamiltonian. 
Moser (|13j) had previously computed the scattering map and Percy Deift (personal 
communication) the wave map, but our arguments are significantly different. 

We thank the comments presented by the referees, which led to a much improved 
text. The authors gratefully acknowledge support from CNPq, CAPES, IM-AGIMB 
and FAPERJ. 



2 Tridiagonal matrices 

We begin this section by sketching some classical facts about tridiagonal matrices 
([13], [14j ) in a phrasing appropriate to our purposes. Let T be the vector space 
of real, tridiagonal, n x n symmetric matrices. A matrix T 6 7" is Jacobi (resp. 
unreduced) if Ti+i^ > (resp. Ti+i^ ^ 0) for i = 1, 2, . . . , n — 1. Let J C 7" be 
the open cone of Jacobi matrices and R™ be the open cone {(xi, . . . ,x n ),X\ < ■ ■ ■ < 
x n }. The ordered spectrum map 0~ o is defined on the open set of real symmetric 
matrices with simple spectrum: a (S) = (Ai,...,A„) E lists the eigenvalues 
of S in increasing order. Let 0(n) be the group of real orthogonal matrices of 
order n. For an invertible M , write the unique QR factorization M — Q(M)R(A7), 
for Q(A/) G 0(n) and R(M) upper triangular with positive diagonal. Similarly, 
when the leading principal minors of M are invertible, write the LU factorization 
M = L(M)U(M) where L(M) is lower unipotent (i.e., lower triangular with unit 
diagonal) and U(Af) is upper triangular. 

For a permutation n S S n , consider the matrix P T with (i, j) entry equal 1 if and 
only if i = (thus P WlW2 = P-k^-k^ and P x e,: = e^,;)). For A = diag(Ai, . . . , A„), 
with Ai < • • • < A„, set 

A 7r = p- 1 AP 7r = diag(A 7r(1) ,A 7r(2) ,...,A 7r(n) ) = diag(AI,AJ,...,A^). 

Finally, let £ C 0(n) be the group of sign diagonals, i.e., matrices of the form 
£ = diag(±l,±l,...,±l). 

Definition 2.1 A square matrix M is LC/-positive if U(M) is well defined and 
the diagonal entries o/U(M) are positive. Given T € T , A = diag(<7 (T)) and a 
permutation -k , the factorization T — Q^A^Q^ is a 7r-normalized diagonalization if 
the orthogonal matrix is LU -positive. 

Equivalently, M is L£/-positive if the determinants of its leading principal minors 
are positive. The 7r-normalized diagonalization of T G T is unique if it exists. 
Indeed, two factorizations Q^A^Qi = Q\A^Q 2 yield Q2 = EQ\ for some E E £, 
Qi and Qi both L[/-positive: we must have E = I. 

We recast a standard result for our purposes. 

Theorem 2.2 The eigenvalues of a Jacobi matrix J are distinct. Given J and 
a permutation ir £ S„, J admits a (unique) n -normalized diagonalization J = 
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Q^A^Qtt where A = diag(cr ( J)). The coordinates of Q^e\ are nonzero. The map 
below is a diffeomorphism: 

Tit : J->R2 x {we MP | |MI = l,Wi > 0} 

J i * (a (J), (l(Q.)nl, \(QM, ■ • • , KQOmD) • 

The second entry w of IV (J) lists the norming constants of J. For different 
7r G 5„, the coordinates of w are merely permuted. Indeed, if J = Q*AQ and 
J = Ql^Q-n then g„ = EP~ 1 Q for some £e£. 

Proof: Simplicity of the spectrum of J and the fact that the first coordinate 
of each eigenvector is nonzero are standard facts ([II])- In other words, given a 
diagonalization J = Q*A n Q, the coordinates of w = Qe\ are nonzero. Clearly, 
the matrices Q G 0(n) for which J = Q*A 7! Q are of the form Q — EQ for a sign 
diagonal E G £, i.e., we may change signs of rows of Q. The values of | I do not 
depend on the choice of Q thus allowing us to define the smooth map r„. using any 
diagonalization (not necessarily 7r-normalized). We show that one such matrix Q is 
LCZ-positive and that is a diffeomorphism by constructing the inverse of T n . 
Construct a Vandermonde matrix V with Vy = A^.7.s and a positive diagonal 

matrix W = diag(u)i, . . . , u) n ). The well known formula for the determinant of a 
Vandermonde matrix implies that the leading principal minors of V are nonzero. 
Thus, there exists a unique sign diagonal E G £ such that EV is LCI-positive: the 
matrices EWV — WEV and Q = Q(EWV) are therefore also LZ7-positive. We 
claim that J = Q*A*Q is a Jacobi matrix. 

To prove tridiagonality, we show that = (Jej, e;} = (A^Qej, Qei) equals for 
i > j + 1. For j — 1, . . . , n, consider the columns Uj — (A^y^ 1 Eii and qj of EWV 
and Q, respectively. The Krylov subspace Kj spanned by Ui, . . . , Uj is also spanned 
by the orthonormal vectors q~i,...,qj since QR — EWV where R — H(EWV) is 
upper triangular with positive diagonal. We have A n Kj C Kj + i and therefore A^qj 
is a linear combination of qx, . . . , qj+i and {A™ qj, q\) — as needed. 

We now show that Jj+i.j > 0. The factorization QR = EWV implies that 
qi - {Ru^Ul G and similarly u l+1 - R l+ i. i+ iq l+ i G K t . Applying A 77 to the 

first relation we have A^qi — (i?^) _1 tti + i G Ki and using the second relation we 
obtain A^qi — (Ra)~ l Ri^i^iq^i G Ki. Now, since Ki J_ we have Jj+i,j = 
(A^^i, q-i+i) — (Ru)^ 1 Ri+i,i+i(qi+i, qi+i) = (Ra)~ > 0. 

Adding up, J = Q* A^Q is the 7r-normalized diagonalization of the Jacobi matrix 
J. From this construction, the inverse of r„. is smooth, completing the proof. ■ 

Let J7a C J be the set of Jacobi matrices J with diag(cr ( J)) = A: norming con- 
stants (or, more precisely, the second coordinate w of r w ) obtain a diffeomorphism 
between J7a and the positive orthant of the unit sphere {w G 1" | ||iu| | = 1, Wi > 0}. 
Let Ja be the closure of J\ in T: clearly, the boundary of J7a consists of reduced 
tridiagonal matrices with non-negative off-diagonal entries, including the n\ diago- 
nal matrices obtained by permuting the eigenvalues Ai, . . . , A„. The boundary of J\ 
is not a smooth manifold: it has a polytope-like cell structure which was described 
in [19J . We now give a more explicit description. 

For T G Jk, write a diagonalization T = Q*AQ, Q G 0(n). Consider the matrix 
T = QAQ*: this matrix is not well defined (due to the sign ambiguity in Q) but 
the diagonal of T is. Define i(T) to be the diagonal matrix coinciding with T on 
the diagonal. Also, let Va be the convex hull of the set of nl matrices A 7r , ir G S n . 

Theorem 2.3 (|2j) The map l constructed above is a homeomorphism t : —> Va 
which is a smooth diffeomorphism between interiors. Furthermore, ^(A 71 ") = A 71 " 
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The original proof of this theorem uses a result of Atiyah on the convexity of the 
image of moment maps dehned on Kahler manifolds 1 ; a more elementary proof is 
given in [T2| . We find this sequence of results to be a good example of the interplay 
between high and low roads in linear algebra, so eloquently described in |18j . 

We now present in detail the case n = 3, where Ja and Va have dimension 2. 
Take A = diag(4, 5, 7) ; Va is a hexagon contained in the plane x + y + z^A + b + 7. 
The spaces J a and Va are given in Figure [T] The polygon Va is drawn in scale; 
the drawing of Ja is schematic. 




Figure 1: The spaces Ja and Va for A = diag(4, 5, 7) 



The triples in the diagram of Ja are of two kinds. The vertices, which are 
diagonal matrices, are labelled by the three diagonal entries. The edges have stars 
in the place of a 2 x 2 block. Thus, for instance, the edge (*, *, 7) consists of matrices 
of the form 

/9 - cos 26» sin 29 \ / c s 0\ /4 0\ /c -s 0\ 
- sin20 9 + cos2(9 = -s c 5 s c 
2 \ 14/ \ 1/ \0 7/ \0 1/ 

where 8 goes from to tt/2, c = cos9 and s = s'm9. Notice that the vertices in Ja 
and Va have different adjacencies, in accordance with Theorem 12.31 A^ 1 and A'*' 2 
are adjacent in Va if and only if A T i and A 7 ^ are adjacent in Ja- 

Allowing arbitrary signs at off-diagonal entries, we consider the tridiagonal 
isospectral manifold Ta, the set of real symmetric tridiagonal matrices T which are 
conjugate to A. Define the sign sequence of an unreduced matrix T as signseq(T) = 
(sign(T 2 i), sign(T 32 ), . . . , sign(T n! „_i)). The subset of Ta of unreduced matrices 
splits into 2™ _1 connected components according to the sign sequence. Conjugation 
by sign diagonals takes one component to another. Thus, Ta is obtained by gluing 
2™ _1 copies of Ja along their boundaries. 

It is shown in [19] that Ta is a compact orientable manifold by proving that 
simple spectra are regular values of the restriction to T of the ordered spectrum 
map a . 

For A = diag(4, 5,7), Figure [2] shows the manifold 7a, a bitorus. The vector 
space T receives an Euclidean metric via the inner product (Ti,T2) = tr(7iT2)- 
The manifold Ta is then contained in the intersection of the hyperplane of matrices 
of trace 4 + 5 + 7 and the sphere of matrices T with (T,T) — 4 2 + 5 2 + 7 2 : this 
intersection is isometric to a sphere centered at the origin in R . A stereographic 
projection takes this sphere (and its subset Ta) to M 3 : Figure [2] is a snapshot of 
the image of Ta under this projection. The small gaps were artificially introduced: 
these tubular neighborhoods of circles in R 3 split the manifold into the four hexagons 
EJaE, Ee£. 

Figure [3] shows again Ta for this example in a more schematic fashion. The 
four hexagons stand for the components of the subset of unreduced matrices: sign 
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Figure 2: A 3d rendition of 7a for A = diag(4, 5, 7) 



sequences label the hexagons. Vertices are diagonal matrices and edges with the 
same label are identified. 

The bitorus is decomposed as a disjoint union of four open hexagons, six open 
edges and six vertices. We generalize this cell decomposition. Any tridiagonal 
matrix T G 7a splits into unreduced blocks Ti, . . . , along the diagonal. Consider 
the subspectra Aj = diag(cr (Tj)) and the sign sequences signseq(Xi). The (open) 
cell containing T is the subset of 7a. of matrices with the same block partition, 
subspectra and sign sequences as T. The cell containing T is naturally identified 
with ,7a i x • • • x 7a* an d therefore diffeomorphic to M™ _fc , from Theorem 12.21 The 
vertices (or cells of dimension 0) of 7a. are the diagonal matrices A 71 ' and the set 7a 
of Jacobi matrices is the cell of maximal dimension n — 1 defined by signseq(T) = 



The reader should notice that norming constants do not admit a smooth natural 
extension to 7\. Indeed, again in Figure^ norming constants are positive on the H — h 
cell of Jacobi matrices. Crossing the horizontal axis to the cell H — takes the norming 
constant w 2 (associated with A 2 = 5) through so, to guarantee smoothness, we 

should have signs H h for the norming constants in the cell H — . On the other 

hand, we can also go from ++ to +— through the edges a or c and such crossings 
would induce the sign patterns — h + and + H — , respectively. 



(+,+,...,+)• 




Figure 3: Gluing instructions for the manifold 7a 



G 



3 Bidiagonal coordinates 



In this section, we construct an atlas for 7X given by a family of charts indexed 
by the permutations jt 6 S„. Each chart is a diffeomorphism rp^ : U\ — > K" _1 
and the chart domains centered at the diagonal matrices A 17 (in the sense that 
- 7r (A 7r ) = 0), form an open cover of T\. The bidiagonal coordinates f3£ for a matrix 
T G U\ are the entries of the vector ip v (T) . 

For n = 3, A = diag(4, 5, 7) and n given by 7r(l) = 3, 7r(2) = 1, 7r(3) = 2 so that 
A" = diag(7, 4, 5) the set U\ is the interior of the polygon in Figure [3] Bidiagonal 
coordinates were used to produce Figure O 7a was partitioned into six quadri- 
laterals centered at diagonal matrices. This decomposition of 7a has four vertices 
(one in the interior of each of the four hexagons described in the previous section) ; 
each quadrilateral touches each vertex once. In the figure, the small gaps split each 
quadrilateral into four smaller ones; boundaries between quadrilaterals are visible 
as the lines along which the mesh loses smoothness. Lines in each quadrilateral are 
level curves of bidiagonal coordinates f3f. We first define the chart domains U\. 

Definition 3.1 For a permutation n, the chart domain U\ is the set of matrices 
T G 7a admitting a ^-normalized diagonalization, i.e., the matrices T for which 
there exists an LU -positive matrix G O(n) with T — Q^A^Q^. 

We now present some properties of the sets IA\ . 

Lemma 3.2 (a) The sets U\ C 7a form an open cover ofT\. 

(b) IfEeS andT eUl then ETE G U\. 

(c) IfT£T\ is unreduced then T G U\ for all permutations n. In particular, each 
set U\ is dense in 7a- 

(d) Take T G 7X with unreduced blocks T\, . . . ,71- of dimensions n\, . . . ,nu along 
the diagonal. For a permutation n, split A 71 ' in blocks: 

(A 7r )i = diag(A^ lH — hn ._ 1+1 , . . . , A^ lH — \- ni _i+rn)- 

Then T G U\ if and only if Ti is conjugate to (A 7r )i for i = 1, . . . , k. 

Proof: (a) For T G 7" A , write T = Q*AQ for some Q G 0(n). Write the PLU 
factorization of Q, i.e., Q = PLU where P is a permutation matrix, L is lower 
unipotent and U is upper triangular. Notice that this is usually possible for P = P n 
for several permutations ir G S„. Thus, all the leading principal minors of P^Q 
are invertible and there exists E G £ such that Q w = EP~ X Q is L[/-positive and 
T = Q*AQ = Q* A n Qir belongs to U\. The set of LC/-positive matrices is open in 
jgmxn an( j therefore each U\ is also open in 7a- 

(b) If T = <2£A 7r Q 7r is the 7r-normalized factorization of T G U\ then ETE = 
(EQ^E)*A n (EQ n E). The matrix EQ^E is /.[/-positive and therefore ETE G U\. 

(c) The case where T is a Jacobi matrix is discussed in Theorem 12.21 If T is 
unreduced then there exists E G £ such that ETE is Jacobi and item (b) completes 
the argument. 

(d) Consider a permutation tt, T G T\ with unreduced blocks T\, . . . and the 
diagonal blocks (A 7T )i as above. From item (c), if Ti and (A 7r )i are conjugate then 
there exist /[/-positive matrices Qi G 0{n{) with Tj = Q*{A'*) i Q i . Let Q be the 
matrix with blocks Qf. the matrix Q is /[/-positive and orthogonal; T — (Q)*A n Q 
implies T G IA\. 

Conversely, assume that T G U\ admits a block decomposition. Then T = 
(P 7r /[/)- 1 A(P 7r /[/) yields UTU' 1 = L~ X A V L and we therefore have U^lfr 1 = 
L^ 1 (A 7T ) i L i where Ui and Li are blocks along the diagonal for U and L. ■ 
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Item (d) yields an alternative description of the chart domain U\ : it is the union 
of all cells in 7/v whose closure contains A". 

We now construct charts ip^ : U\ — > R ,l_1 and their inverses (j>„. 

Definition 3.3 Given (3^ , . . . , fJ^_i and A 17 = diag(AJ, . . . , X^) build the bidiagonal 
matrix 

( \\ \ 

B n = 02 A 3 

\ fin-l KiJ 

Take the diagonalization B v — L~ X A^L^ where is lower triangular with unit 
diagonal. Define the inverse chart <f>^ : R' i_1 — > U\ by 

■ • • . f%-i) = Q(L«)* A" Q(L W ). 

Notice that 4> n (0) = A v . It is easy to see that a matrix L n as above exists; an 
explicit formula for its entries is given in the proof of Proposition I3.6I The claim 
that U\ is a valid counterdomain for 0^ requires a proof. Set — Q(L n ): the 
matrix T = ^(jj^, ■ ■ ■ , PZ-i) — Qn ^ Qn is clearly symmetric. On the other hand, 
the factorization T — R(L 7r )_B 7r (R(L 7r )) _1 implies that T is upper Hessenberg and 
therefore T £ 7a. Since R(L W ) has positive diagonal and = L 7r (R(L 7r )) _1 , we 
have that the matrix is L[/-positive, T — Q* A 71 " is a 7r-normalized diagonal- 
ization and therefore T 6 U\. 

Definition 3.4 For T £ U\, take its it -normalized diagonalization T — Q^A^Q^ 
and write = h(Q 7r ), = U(Q 7r ) J = U^ 1 . Set 

B?r = R-x 1 TR 7T = L~ A K L„, 

The matrix B w is bidiagonal and its off-diagonal entries f3£ = {Bn)i+i. i, i — 
1, ... ,?i — 1, are the 7r-bidiagonal coordinates of T . The chart ip v : IA\ — > M™ _1 is 
the map taking T to (f3f, . . . , f3„-i)- 

We prove that B^ is indeed bidiagonal. From B^ = R~ 1 TR 7T , B^ is upper 
Hessenberg and from B n = L~ 1 A 7r L 7r , it is lower triangular with diagonal entries 

1 ) " * * ) \i- 

The maps tp v and are clearly smooth. By construction, one is the inverse of 
the other, implying the following result. 

Theorem 3.5 The map <p„ : R" _1 — > IA\ is a diffeomorphism with inverse : 

As an example of bidiagonal coordinates, let A = diag(4, 5,7). Set 7r(l) = 3, 
7r(2) = 1, 7r(3) = 2. Matrices will be described by their 7r-bidiagonal coordinates 
x — [3^ and y — (3%. Since B„ = L^A^L^, we obtain 

h o o\ h o o\ / l o o> 

A 7r = 4 , S w = \x 4 , L w = | -x/3 1 
\0 5/ V° y V \-a*/2 V 1; 

and writing Qrr = T^U^ we have 

2^1 



, / 6r2 6x(2 + 3y 

Q K = -2xr 2 3(12 + x 2 y 2 ] 

riT2 \-3xyr 2 2y(l8-x 2 ) 
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where n = ^36 + 4x 2 + 9x 2 y 2 and r 2 = ^36 + 36y 2 + x 2 y 2 . From T = Q^A^Q^, 
we have T = A" + M/ (rfr 2 ) where 



'-6^ 2 (2 + 3y 2 )r 2 



M 



%xr\ 




§xr\ 



(72 + 108y 2 )r{ - {72-4x')r 

p2 



6yrf 





6yrf 
-2y 2 (18 - x 2 )rl 



This is an explicit parametrization (f> v : WL — > Z/^ C 7a of the polygon in Figure 
[3j From this formula, a; = implies (T)n = 7 and (T)2i = while y = gives 
(^)33 — 5, (T) 32 = 0, consistent with the description of U\ in Lemma [ 



Proposition 3.6 For any 'permutation ir and any Jacobi matrix J £ J a, the norm- 
ing constants Wi and the -k -bidiagonal coordinates /3? are related by 



(K-^)---W-K-i) 



2<i<n, 



Proof: Let A 7 ' and be as above. Set 



«V(i) 



1< i < n - 1. 



L = 



1 







AJ-AJ 



(AJ-AJ)(AJ-AJ) 



••• 

1 ••• 

... i 



A straightforward computation verifies that LB n = A n L and therefore L = L„. 

Norming constants are given by the absolute values of entries in the first column 
of which in turn is the normalization of the first column of L v . I 

Bidiagonal coordinates change signs together with off-diagonal entries in a simple 
fashion. 



Lemma 3.7 If E = diagOx, • ■ ■ , cr n ) £ £, T £ U\, and ip„(T) = {(31, 
then ^^(ETE) = (p\Oifl\i ■ ■ ■ , o~ n ~io~ n Pn-i)- ^ n other words, if and B^ are the 
bidiagonal matrices associated to T and ETE ( as in Definition \3.4\ ), respectively, 
then 4 = EB n E. 

Proof: From Lemma El if T £ U\ then f = ETE £ U\. Clearly, if the tt- 
normalized decomposition of T is T = Q^A^Q^ then ETE = (EQ^E)* A* (EQ^E) 
is the 7r-normalized diagonalization of ETE. Also, ^(EQ^E) = E\j(Q ir )E and 
therefore, if = L(Q 7r ) then B^ = (EL^E)- 1 A 7r (EL^E) = EB^E. U 

Norming constants break down at the boundary of Ja- We will prove in the 
following proposition, however, that near a reduced matrix T with (T)j +1 , = 
the values of (T)j+x,-i and f3f are comparable. This will be useful when we use 
bidiagonal coordinates to study the asymptotics of isospectral maps. 

Proposition 3.8 Given A and ir, the quotient qf : U\ — > K defined by qf(T) = 
/3f(T)/((T) i+ x,i) is smooth, positive and qf (A n ) = 1. Also, qf (ETE) = qf (T) for 
all TeUJ and E £ £ . 
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In particular /3f and (T)i + u have the same sign regardless of n. 

Proof: Clearly, (T) i+M = if and only if ETE = T where E j:j = 1 (resp. -1) 
for j < i (resp. j > i + Thus, from Lemma l3.7( f3?{T) = if and only if 
{T) i+ i,i = 0. 

We study the i-th partial derivative of {^{Pi, ■ ■ ■ , /3£_x))j+i,i when /3f = 0. 
Take a path T : R -> T A , T(f) = 7r (/?] r , . . . , i, . . . , /3£_ x ). From Lemma ET7| all 
entries of T(t) except (i + 1, i) (and i + 1)) are even functions of t and therefore 
the corresponding entries of X"(0) equal 0. On the other hand, since (j>^ is a dif- 
feomorphism, T'(0) must be nonzero. It follows that (T' (0)) i+l i ^ and therefore 
qf{T) is well defined, smooth and nonzero even when the denominator vanishes, 
i.e., at reduced matrices. 

The symmetry property indicated in the last claim follows from Lemma 13.71 
Positivity is obvious for Jacobi matrices, extends to unreduced matrices by sym- 
metry and to reduced matrices by continuity. In order to compute qf (A 71 '), con- 
sider the path T(t) = c/V(0, . . . ,t, . . . , 0) (with t in the i-th position). Clearly, 
B n (t) = A 71 " + t(ej+i)*e, so that /?f(t) = t. A straightforward computation yields 

= 'vl )2 , 2 *? «T(T(t)) = (Ar+1 " Ar)2 + e 



The result now follows by setting t = 0. 



4 Iterations in 



A 



We now apply bidiagonal coordinates to the study of the dynamics of QR type 
iterations. As a simple example, we present in Theorem 14.21 a new proof of the 
well known fact that the Rayleigh quotient shift iteration has cubic convergence. 
A subtler example is the Wilkinson's shift iteration which is studied with the same 
technique in [10] and [TT]. Excellent references for the spectral theory of Jacobi 
matrices are [5], [5] and [13] . 

For an open neighborhood X C 1 of the spectrum {Ai, . . . , A„} = cr(A) and a 
continuous function / : X — > R taking nonzero values on <x(A), there is a smooth 
map F : 7a — ► 7a> the QR step induced by /, given by 

F(T) = Q(/(T))*TQ(/(T)). 

Continuity of / is sufficient to imply that F is a well defined smooth function: 
indeed, if / and the polynomial p coincide on a (A) then f(T) = p(T) for all T G 7a 
and therefore F = P, the QR step induced by p. The standard Qi? step corresponds 
to f(x) — x. Since T and f(T) commute, we also have 



F(T) = K(f(T))TR(f(T))- 1 . 

From the first formula, F(T) is symmetric; from the second, it is upper Hessenberg 
with sub-diagonal elements with the same signs as in T. Thus, F : 7a — > 7a 
preserves J7a, the other cells and the open subsets U\. 

Let F*" = ofo^, : M™ _1 -> M n_1 ; in other words, i*"^" is obtained from 
F\u* by a change of variables using bidiagonal coordinates. 



Proposition 4.1 For f taking nonzero values on the spectrum ofT, 



F+* (&,...,!%-!) = 



f{K(2)) 



PI,- 



f{K{n)) 
f{\r{n-l)) 
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Also, QR iterations generically converge to diagonal matrices. More precisely, if 
TeW h and \f{K {1) )\ > \f(K{2))\ > ■ > \f(K(n))\, then lim fc ^ +00 F k (T) = A" 
with asymptotics 



(3?= lim (F fe (T)) l+1 . 4 



f{K(i)) 



f{K(i+i)) 



Proof: Take T G U\ and set T' 



F(T) = Q(/(T))*TQ(/(T)). We show that T' G 
and relate the corresponding matrices and B'^. Consider the 7r-normalized 
diagonalization T = Q*A*Q^ and write L n = L(Q 7r ). Since f(T) = g;/(A x )Q 7r 
and Q(ZM) = ZQ(Af) for an arbitrary matrix Z G O(n) and invertible M, we 
have T" = (Q)*A"Q where Q = Q.Q(Q;/(A' r )g 7r ) = Q[f(A*)Q«). Take Q; = 
Q(\f\(A 7T )Q 7r ): clearly, Q'^ is L£/-positive and Q' n = EQ for some E e £ and 
therefore T" = (QJ r )*A 7r Q^ r is the 7r-normalized diagonalization of T" eWJ. Write 
L,r = L(Q 7r ), L; = L(<%). Since L(M) = L(Mi?) for arbitrary L£/-positive 
matrices M and invertible, upper triangular R, we have L(M) = L(Q(Af)) and 
thus = Ld/KA^)^^). Notice that if D is an invertible diagonal matrix and M 
is LCZ-positive then L(DM) = DL(M)I)- 1 : we obtain = |/|(A 7r )L 7r (|/|(A 7r ))- 1 
and therefore 

Si = (L'^A'L* = (|/|(A-))B 7r (|/|(A^))- 1 . 

This finishes the proof of the first formula. The convergence properties now follow 
easily from Proposition 13.81 ■ 

This proposition yields yet another evidence for the naturality of the bidiagonal 
coordinates f3£ . 

The cubic convergence to deflation of the QR iteration with Raylcigh quotient 
shift is well known ([33]); using bidiagonal coordinates, we deduce it from a Taylor 
expansion. For s G R, let f s (x) = x — s so that the QR step F s : 7a — > 7a is 
defined for s ^ c(A). In other words, we have a map F : (R \ <r(A)) x 7a — > 7a, 
F(s, T) = F S (T). The map F cannot be continuously extended to R x 7a; it follows 
from Proposition 14. 11 however, that F can be continuously extended to pairs (s, T) 
if s = Xi and the (possibly reduced) matrix T has the eigenvalue Xi in the spectrum 
of its bottom block. More formally, consider the set 



V F = ((R \ (7(A)) x T A ) U |J {A,} x |J 



C R x T A 



7reS„,7r(i) = 



The set X>f is open since points T G {\r(ra)} x admit the explicit open neigh- 
borhood (A£ — 7/2, A^ + 7/2) xW{ C 2>f where 7 = min^ |Aj — Xj | is the spectral 
gap of A. The function F is defined in V-p by 



F( s ,^(/3^,...,/3:_ 1 )) = ^ 



A3 



A? 



XI 



A~_ 



(*) 



The Qi? iteration with Rayleigh quotient shift G : T>q — > 7a is defined by G(T) = 
F((T)„, n ,T) where D G C T A is the open set {T e T A ((!%,„, T) G £> F }- Notice 
that, from a numerical point of view, falling outside T>q is an instant win: (T) n n 
is an eigenvalue. 

The deflation set Aq C 7a is the set of matrices T with (T)„ jn _i = 0. The set 
Ao is the disjoint union of the subsets A of matrices T G Ao with (T)„.„ = A.;. 
Notice that Aq is diffeomorphic to 7a, where Aj = diag(Ai, . . . , A,_x, Ai+i , ... ,X n ) 
and therefore a connected component of Ao. Clearly, Ao C T>g since T„ in _i = 
and T n . n — X 7 ^ imply that T G U\. The (compact) deflation neighborhood A e is the 
set of matrices T G 7a with (T)„ jra _i| < e. 
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Theorem 4.2 There exist e > and c > such that A e C T>c, G(A e ) C A e and 
for all TeA.we have |(G(T)) n , n _ 1 | < c |(T)„,„_ 1 | 3 . 

Proof: Since the chart domains U\ are open dense sets covering 7a, there exist 
compact sets K v C U\ with IJ-n-eS = ^~ A - Take > such that ip w (K w ) is 
contained in the box (— M n , M„.) n_1 : we then have 

Ao= |J ^((-M ff ,M w ) n - 2 x{0}). 

Equation (*) above yields a formula for G^" : P G *» — > M™ -1 where Z?^,, = 
■0^(Pg n^X) C R"- 1 is an open set with R"^ 2 x {0} C V G ^ and 

S = sift, . . = (MPl, ■ ■ ■ 

For each 7r there exist e' n > such that, if = (— M^-, M^)™ -2 X (— e'„,e'„) then 
■ ■ ■ , £ ^tt implies s — A£| < 7/2 (where 7 is the spectral gap) and 
therefore A' w C T>q^ . Due to the presence of absolute values, the function G*" is 
almost certainly not smooth in A'. Set g n : A' v — > R, 

\ n - s 

9ir\Pl i ■ ■ ■ 1 Pn-l) = T"wr r 
l A n-l _ - S ! 

The function is smooth and /9£_i = implies g w (0i, ■ ■ ■ , = 0. Since g is 

even (from Lemma T3.7p . its first order partial derivatives at points with (3^-1 = 
all vanish and the Taylor expansion for at such points starts with terms of degree 
2. By compactness of A'„, there exists a constant c w such that 

g n (fi, 0Z-i) < c^{pl_ x f, \(G**(J%, . . . , #U))„-i| < c^l^f. 

From Proposition 13. 81 /9£_i and (T)„ „_ 1 are comparable: there exists c T such that 
T G 0tt(^) implies 

|(G(T)) 

n,n — 1 ,rt— 1 • 

Take c = max 7re s n and e > such that e < c -1 / 2 and A e C A'. If T G A e we 
therefore have |(G(T)) ni „_x| < c|(T) n! „_i| 3 < e, proving the claims. ■ 



5 To da flows 

Recall that the Toda flow ([6], [13], [4]) solves the differential equation 

J'(t) = [J(t),Tl a (J(t))], J(0) = J . 

Here the bracket is the usual Lie bracket on matrices [A\, A2] = A\A^ — A^A\ and 
H a (M) is the skew-symmetric matrix having the same lower triangular entries as 
M. As is well known, this flow preserves spectrum and the set J7a- If w(t) is the 
vector of norming constants for J(t), we have 

= exp(tA)w(0) 
W[ ' ||exp(tA)«;(0)|| ' 

thus, up to normalization, the function w is the solution of a linear differential 
equation. Taking quotients and using Proposition 13.61 shows that the evolution of 
is truly linear: 
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In other words, B' n = [B n , -A x ]. 

Clearly, the Toda flow is well defined in T\. Similar formulae hold for other 
flows in the Toda hierarchy: for a function g, consider the differential equation 
T = [T, H a g(T)] ; it turns out that, despite g(T) not being tridiagonal, [T,U a g(T)} 
is symmetric and tridiagonal. Integrate the differential equation to define T(t) for 
all tgl. In bidiagonal coordinates, it is easy to compute limits and asymptotics 
of Toda flows. 

Proposition 5.1 In it -bidiagonal coordinates, the equation T' = [T,H a g(T)] be- 
comes a decoupled linear system: 4f/3^ — (ff(AL_ 1 ) — g(XJ))f3f . In particular, if 
T(0) S Ul and g(K{i)) > d{K(2)) > ■■■ > g{K(n)) then lim t ^ +oc /3%(t) = 0, so 
that hm t ^ +00 T(t) = A w with asymptotics 

^fe(O) = t lim (T(t)) ktk+1 exp ((£r(A w ( fe) ) - g(X n{k+1) )) t) . 

Proof: The formula below follows by direct computation ([S], [12], [15], [15]): 

r(t) = Q(exp(^(T(0))))*T(0)Q(exp(t 5 (T(0)))), 
or, in other words, r(i) = F(T(0)) where f(x) = exp(t g(x)). From Proposition ^. 11 

(PUt), #U(t)) = (e^)-»( A f ))*/^(0), . . . , eW^-'WUH'fl^O)) 

and the differential equation for (3f follows by taking derivatives. The last formula 
is now a consequence of Proposition 13.81 ■ 

As an application of the bidiagonal variables we consider the scattering prop- 
erties of the Toda flow. From a more physical point of view, the Toda flow is the 
evolution of n particles of mass 1 on the line given by the Hamiltonian 

n 2 n— 1 

H{x, y) = y + ex P(^ _ x k+i), 

k=l k=l 

where x = (x±, . . . , x n ) and y — (j/i, . . . , y n ) are respectively the positions and 
velocities of the particles. Without loss of generality, 

$>*(*) = £>(*)= 0. 
fc fc 

More explicitly, positions and velocities satisfy the differential equation 

x 'k = H Vk =y k , y' k = -H Xk = exp(a; fc _i - x k ) - exp(ir fc - x k +i), k=l,...,n, 

where we take the formal boundary conditions xq = — oo, = oo. The two 

versions of the Toda flow are related by Flaschka's transformation ([6]): 

T 1 j 1 f x k - x k+ i 
Jk.k = ~2 yk > J k,k+i = g ex P I 2 

Notice that tr(J) = for the Jacobi matrix J constructed from x and y. 

We know that J(t) tends to A 71 "* when t — > ±oo where tt~ is the identity permu- 
tation and 7r + is the reversal 7r + (fc) = n + 1 — fc. The convergence of the diagonal 
entries implies that the velocities yk(t) approach — 2X K± / k \. From the convergence 
to of the off-diagonal entries, the force ±exp(x k — x k +i) between particles fc and 
k + 1 tends to when t — » ±oo. Thus, asymptotically, the particles undertake 
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independent uniform motions of the form Xk(t) « c fc^ + ^fc when t — > ±00. Clearly, 
= — 2A 7r± ( fe -). The wave and scattering maps are 

^ ± (x 1 (0),..., : r n (0),y 1 (0),...,y„(0)) = (c±,..., Cl t,d 1 : ,...,d I t), 
^(cj , . . . , c n , d j , . . . ,d n ) = (c l , . . . , c^, <ij , . . . , d^), 

respectively. They are related by 5 o = Moser ([13]) proved that S is 

indeed well defined and computed it. We obtain Moser 's result by first computing 
wave maps in bidiagonal coordinates. 

Proposition 5.2 Given initial conditions (a^x(O), . . . , a; n (0), yi(0), . . . , y n (Q)) with 
^iEfc(O) = ^2yk(Q) = 0, apply Flaschka's transformation to obtain J(0) with 
eigenvalues Ai < • • • < A n and bidiagonal coordinates 01 ± for 7r_ (k) = k and 
7r+(fc) = n + 1 — k. Then 

4 = -2A ff±(fc) , ^ = £ + 2 + (n - 2* + 1) log2, 

j<fc j>fc 

where 0^ — log /3£ ± . The scattering map is given by 

c i+i-k = c k > 4+i-fc = + 2 E lo s l c 7 - c fc I - 2 X! lo e l c 7 - c fe I • 

j<k j>k 

Proof: From Proposition 15. II with g(z) = z, 

Jk,k+i = ^ exp ( — ^ k+1 ^j ~ exp ((A 7r±(fc+1) - A 7r±(fe) )i) 

in the sense that quotients tend to 1 when t — » ±00. Taking logs, 

{x k - Xk+i) - 2(A 7r±(fc+1) - K ±(k ))t w (d± - = 2/3£ ± + 2 log 2, 

where now the difference goes to zero. Since J^ fc Xfe = 0, we have ^\ = and 
the formula for the wave operator follows. 

Proposition 13.61 yields j3~£ = —(3~_ k + 5k where 

yj>n—k j<n—k J \ j>n+l— fe j<n+l— & 

and = log A^-^) — Kv~(j) |- Use this equation to write 



n J ' — ' n 

\3<k J>k 

+ E + E ^T^^ + (n - 2fc + 1) log 2. 

j<k j>k 

Replacing k by r(k) = n + 1 — k in the equation for g?^~ yields 

c+i-* = -[E ^t^- + E ^r^^- 1 - - 2 ^ + 1) 2 

and therefore 

-2j. , ^2(n-j) 



4 = «c+i- fc + E + E ~Sr^ + 2 ( n - 2fc + lo § 2 - 

The simplification yielding the scattering map is now an easy exercise. 
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